DO NOT MERGE: 2bit llm debug changes#26960
Draft
quic-calvnguy wants to merge 4 commits intomicrosoft:mainfrom
Draft
DO NOT MERGE: 2bit llm debug changes#26960quic-calvnguy wants to merge 4 commits intomicrosoft:mainfrom
quic-calvnguy wants to merge 4 commits intomicrosoft:mainfrom
Conversation
…sion changed the amount of memory allowed.
[QNN EP] Add QnnIr backend support
### Description
Enable ORT QNN-EP to use QnnIr backend to generate
DLC in QNN-EP path
### Motivation and Context
QnnIR backend prepare QNN IR Graph and save it in DLC container file.
The DLC container file can be used with DLC based debugging tools
packaged in Qualcomm's QAIRT SDK
[QNN-EP] EP can serialize QNN graph to DLC
* Don't silently fallback to QnnCpu when QnnSaver was explicitly selected as the execution backend.
* Add support for serializing to .dlc via the QnnIr backend.
* Minor fixes.
Review feedback:
* Improve names of two functions
* Add documentation of the new EP options to onnxruntime_c_api.h
Enable DLC generation for UDO
[QNN-EP] Enable Optrace from QNN into QNN EP
- Add optrace profiling level
- Add profiling to compose graph
- Add new qnn system profile serializer class
- Add API versioning safeguards
- Add backwards compatibility for QNN API < 2.28.1
- Use QNN System Profile API for QNN API >= 2.28.1
- Check for log file at end of profiling unit test
- Ensure system libs are loaded when profiling is enabled
add optrace option to perf test
Add option to enable/disable rpc polling
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Description
Motivation and Context